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Abstract 

We design and analyse approximation algorithms for the minimum-cost connected T-join 
problem: given an undirected graph G = (V, E) with nonnegative costs on the edges, and a 
set of nodes find (if it exists) a spanning connected subgraph H of minimum cost such 

that every node in T has odd degree and every node not in T has even degree; H may have 
multiple copies of any edge of G. Two well-known special cases are the TSP (T = 0) and the 
s,t path TSP (T = {s,t}). Recently, An, Kleinberg, and Shmoys [STOC 2012] improved on 
the long-standing | approximation guarantee for the latter problem and presented an algorithm 

based on LP rounding that achieves an approximation guarantee of ~ 1.61803. 

We show that the methods of An et al. extend to the minimum-cost connected T-join prob- 
lem. They presented a new proof for a | approximation guarantee for the s,t path TSP; their 
proof extends easily to the minimum-cost connected T-join problem. Next, we improve on the 
approximation guarantee of | by extending their LP-rounding algorithm to get an approxima- 
tion guarantee of ^ = 1.625 for all \T\ > 4. 

Finally, we focus on the prize-collecting version of the problem, and present a primal-dual 
algorithm that is "Lagrangian multiplier preserving" and that achieves an approximation guar- 
antee of 3 — \t\-i wncrL \T\ — 4- Our primal-dual algorithm is a generalization of the known 
primal-dual 2-approximation for the prize-collecting s,t path TSP. Furthermore, we show that 
our analysis is tight by presenting instances with \T\ > 4 such that the cost of the solution found 
by the algorithm is exactly 3 — \t\-i times the cost of the constructed dual solution. 

Keywords: approximation algorithms, LP rounding, primal-dual method, prize-collecting 
problems, T-joins, Traveling Salesman Problem, s, t-path TSP. 
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1 Introduction 



The Traveling Salesman Problem (TSP) and its variants, especially the s, t path TSP, are currently 
attracting substantial research interest. We focus on a generalization that captures the TSP and 
the s, t path TSP. 

Let G = (V, E) be an undirected graph with nonnegative costs c e on the edges e € E and let 
T be a subset of V . A T-join is a multiset of edges J of G such that the set of nodes with odd 
degree in the graph H = (V, J) is precisely T, that is, a node v € V has degj(u) odd if and only 
if v G T, [16]. A (spanning) connected T-join is a multiset of edges F of G such that the graph 
.ff = (V, T) is connected and T is the set of nodes with odd degree in H, that is, a node v £ V 
has deg^(u) odd if and only if u 6 T. Clearly, we may (and we shall) assume that G is connected 
and that |T| is even, otherwise, no connected T-join exists; moreover, we may assume that each 
edge of G occurs with multiplicity zero, one, or two in H, otherwise, we may remove two copies of 
an edge from H while preserving the connected T-join property. In the minimum-cost connected 
T-join problem, the goal is to find a connected T-join of minimum cost. Two well-known special 
cases are the TSP (T = 0), and the s,t path TSP (T = {s,t}). 

By a metric graph G we mean a complete graph on V(G) such that the edge costs satisfy the 
triangle inequality. The metric completion of a graph G is given by the complete graph on V{G) 
with the cost of any edge vw equal to the cost of a shortest v, w path of G. It can be seen that 
G has a connected T-join of cost at most 7 if and only if the metric completion has a connected 
T-join of cost at most 7. Thus, we may assume that the given graph G is a metric graph. 

Christofides presented an algorithm for the (metric) TSP that achieves an approximation guar- 
antee of |, and this is the best result known for this problem. Hoogeveen [TU] extended the 
algorithm and its analysis to the s,t path TSP, and proved an approximation guarantee of |. Re- 
cently, An, Kleinberg, and Shmoys pQ improved on this long-standing | approximation guarantee 

and presented an algorithm that achieves an approximation guarantee of ~ 1.61803. To 

the best of our knowledge, there is only one previous result on approximating min-cost connected 
T-joins: Sebd and Vygen [15] present a very nice |-approximation algorithm for unweighted graphs 
(each edge has unit cost); in this context, we mention that the input graph cannot be assumed to 
be a metric graph. Sebo and Vygen [15] were motivated in part by previous advances on the special 
case of T = (namely, the graphic TSP) by Oveis Gharan, Saberi and Singh p3], Momke and 
Svensson [12], and Mucha [13]; in fact, Sebo and Vygen [15] achieve an approximation guarantee 
of I = 1.4 for this special case. 

All of our algorithms follow the plan of Christofides' algorithm: first, compute an appropriate 
tree, then, compute a D-join of minimum cost, where D denotes the set of nodes that have the 
"wrong degree" in the tree; finally, return the union of the tree and the TJ-join. (Here, a Z)-join 
means a multiset of edges E' such that D is the set of nodes of odd degree in (V, E'); throughout 
the paper, we use "T" and "T-join" as in the abstract, that is, T denotes a set of nodes specified in 
the input; we use a symbol different from T for a join with respect to some auxiliary set of nodes.) 

We show that the methods of An et al. extend to the minimum-cost connected T-join problem. 
They presented a new proof for a | approximation guarantee for the s, t path TSP; in Section [3j 
we show that their proof extends easily to the minimum-cost connected T-join problem. More 
interestingly, in Section we generalize the main result of An et al. to obtain an approximation 
guarantee of ^ = 1.625 < | for |T| > 4. Our analysis uses some new methods over that of An et 
al. and we elaborate in the next subsection. 
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Our second batch of results pertain to the following prize-collecting version of the problem: in 
addition to the graph G = (V, E) and the edge costs c, there is a nonnegative penalty ir(v) for 
each node v G V \ T; the goal is to find / C V \ T and a connected T-join F of the graph G \ I 
such that c(F) + tt(I) is minimized. The special case of the prize-collecting TSP (T = 0) has been 
extensively studied for over 20 years, starting with Balas [3], and an approximation guarantee of 
1.91457 has been presented by Goemans [8]; also see Archer et al. [2]. The special case of the 
prize-collecting s, t path TSP (T = {s, t}) has also been studied, and An et al. [1] present an 
approximation guarantee of 1.9535. 

We focus on the general problem (prize-collecting connected T-join) and present a primal-dual 
algorithm that achieves an approximation guarantee of 3 — M—i when \T\ > 4. Our primal-dual 
algorithm may be viewed as a generalization of the known primal-dual 2-approximation for the 
prize-collecting s,t path TSP by Chaudhuri et al. [5], and we also match their approximation 
guarantee of 2 for \T\ = 2. Furthermore, we show that our analysis is tight by presenting instances 
with \T\ > 4 such that the cost of the solution found by the algorithm is exactly 3 — pfpi times 
the cost of the constructed dual solution. 

In fact, the total penalty of the set of isolated nodes / in the solution found by our algorithm is at 
most one times the penalty incurred by the LP solution. Thus, our algorithm has the "Lagrangian 
Multiplier Preserving" property; this property is useful for the design and analysis of approximation 
algorithms for cardinality-constrained versions of problems. 

Our algorithm and analysis follow Chaudhuri et al. [5], and also we follow the well-known method 
of Goemans and Williamson [9] for the prize-collecting Steiner tree problem. One key difference 
comes from the cost analysis for the D-join, where D denotes the set of nodes that have the wrong 
degree in the tree computed by the algorithm. A simple analysis of the cost of this D-join results 
in an approximation guarantee of 4 — 0(|T| _1 ). To get the improved approximation guarantee, our 
analysis has to go beyond the standard methods used for analysing the approximation guarantee 
of primal-dual algorithms. 

Most of our notation is standard, and follows Schrijver [16]; Section [2] has a summary of our 
notation. 

1.1 New Contributions on Min-Cost Connected T- Joins 

This subsection discusses the main points of difference between our analysis and that of An et al. 

Our algorithm and analysis follow that of An et al. at a high level. The algorithm solves an 
LP relaxation, and using the optimal solution x* of the LP, it samples a random spanning tree J, 
and then computes a min-cost -D-join, where D is the set of nodes of the wrong degree in J. The 
analysis hinges on constructing a fractional D-join (a solution to an LP formulation of the D-join 
problem) of low cost to "fix" the wrong-degree nodes in J. 

We construct the fractional D-join as y := a ■ x{J) + ' x * + z where x{J) is the 0-1 incidence 
vector for the edges of J, z is some "correction" vector (described in Section I4.4p , and a and (3 are 
carefully chosen (scalar) values. By the integrality of the D-join polyhedron, the cheapest D-join 
has cost at most the cost of y. By linearity of expectation, the expected cost of y is less than or 
equal to a + /3 times the cost of x* plus the expected cost of z. It turns out that the correction 
vector z is needed only for a special type of cut, the so-called r-narrow cuts: these are given by 
T-odd sets U such that x*(6(U)) < 1 + r. When \T\ = 2, as in An et al. pQ, it turns out that (the 
node sets of) the r-narrow cuts form a nested family U\ C U2 C • • • C U% C This is no longer 
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true for |T| > 4, and hence, the analysis of the correction vectors by An et al. does not apply when 
\T\ > 4. 

We prove that the r-narrow cuts form a laminar family when \T\ > 4. Moreover, in contrast 
with An et al., our analysis hinges on the "partition inequalities" that are satisfied by spanning 
trees and fractional spanning trees such as x* , namely, every partition V = {P\, . . . , P^} of the 
node set into nonempty sets satisfies x*(5(Pi, . . . , Pk)) > k — 1. In our application, we are given 
a subfamily of r-narrow cuts from the laminar family of r-narrow cuts, and we have to obtain a 
partition of the nodeset V into nonempty sets that correspond to the given subfamily. It is not 
clear that this holds for r close to 1, but, we prove that it holds for r < ^. 

To complete the analysis, we have to fix a, (3 and r subject to several constraints, and we have 
to minimize the expected cost of the fractional D-join. We choose r = ^, and this gives a = g, 
(8 = 1; moreover, we get a bound of |cost(x*) on the expected cost of the fractional D-join, and 
thus we get an approximation guarantee of ^ = 1.625. We have an example for |T| =4 showing 
that ^ is the optimal value for r for our methods; see Section [4.31 

2 Preliminaries 

We first establish some notation. Given a multiset of edges F, we use c{F) to denote the cost of 
F; thus, c(F) = ^2 e fieC e ; here, /if denotes the number of copies of the edge e in F. 

For any set of edges F of G, we use x(F) to denote the zero-one incidence vector of F, thus, 
x(F) ^ {0) 1} > an d we use V(F) to denote the set of incident nodes. For any set of edges F of G 
and any subset of nodes S, we use F(S) to denote the set of edges of F that have both endpoints 
in S, and we use 5f(S) to denote the set of edges of F that have exactly one endpoint in S. We 
use the same notation for a multiset of edges. 

For any set of nodes S, let S denote the complement V \ S. A set of nodes S is called T-even 
if \S n T\ is even, and it is called T-odd if \S n Tj is odd. Also, we say that a cut 5f(S) is T-even 
(respectively, T-odd) if S is T-even (respectively, S is T-odd). 

We say that two subsets of nodes R and S cross if R fl S, R U S, R \ S and S \ R are all 
non-empty, proper subsets of V. A family of subsets of V is called laminar if no two of the subsets 
in the family cross. Equivalently, a family of subsets of V is laminar if for any two subsets R, S in 
the family, either R and S are disjoint or one contains the other. 

Let V = {Pi, ... ,Pfi\ be a partition of the nodes of G into nonempty sets P±, . . . , P^. Then 
8{V) denotes the set of edges that have endpoints in different sets in V . 

For ease of notation, we often identify a tree with its edge-set, e.g., we may use J C E(G) to 
denote a spanning tree. Moreover, we may use relaxed notation for singleton sets, e.g., for a node 
i, we may use V — t instead of V \ {t}. 

We use the the next fact throughout the paper. It relates the number of odd-degree nodes in a 
set U C V and the parity of the cut 5(U). 

Lemma 2.1 Let G = (V,E) be a graph, and let T C V have even size. Let F be a multiset of 
edges of G, and let D be the set of wrong -degree nodes w.r.t. F, that is, D consists of nodes v G T 
with deg F (v) even and nodes v € V \ T with deg F (v) odd. Then, for any U C V we have 

(i) \S F (U)\ = \UHDnT\ + \U<lDnT\ (mod 2); 

(ii) moreover, if U is both T-odd and D- odd, then \8f(U)\ is even. 
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Proof. First, we prove (i). Summing over the degrees in F of all nodes in U we have the equation 

E £ + E + E = 2 i F ( [/ )i + \ 5 f{u)\, 

veUnDnr v&unDnT veunDnT v&unBnT 

since each edge in 5f(U) is counted once and each edge in F(U) is counted twice. Now, the degree, 
in F, of each node in U n D n T and J7 fl L> fl T is even and the degree of each node in U H D D T 
and J7 fl D n T is odd. Then (i) follows by reducing modulo 2. 

Now, consider (ii). Since U is both T-odd and D-odd, it can be seen that \U Pi D n T| and 
|J7n D fl T| have the same parity. Then, by (i), |£p(i7)| is even. □ 

2.1 An LP Relaxation 

We will assume that G is a metric graph for both the 5/3-approximation and its improvement. If 
T = 0, then any solution F forms an Eulerian graph H = (V,F); then the standard argument 
of following an Eulerian walk and shortcutting past repeated nodes yields a Hamiltonian cycle of 
no greater cost. Otherwise, if T ^ 0, then the next result shows that there is a minimum-cost 
solution subgraph H = (V, F) that is a spanning tree; the proof follows by generalizing the notion 
of shortcutting an Eulerian walk. 

Proposition 2.2 Let G = (V, E) be a metric graph, and letTCV have even cardinality. Assume 
that T 0. Given a connected T-join F , we can efficiently find a spanning tree of G of cost < c(F) 
that is also a connected T-join. 

Proof. Let F be a connected T-join in G. Suppose that either F has multiple copies of an edge 
of G or F is not acyclic. Then we give a procedure for finding a connected T-join of smaller size 
and no greater cost. This procedure can be repeated until we find a connected T-join that is simple 
and has no cycles. 

Let C := v\V2, V2V3, . . . , Vk-\Vk,VkV\ be a cycle in F for k > 2, where the case k = 2 means we 
are considering two copies of an edge V1V2 in F. We first claim that there is another edge in F apart 
from V1V2, • • • , Vk-iVk,VkVi that has at least one of these vi as its end node. If V \ {v±, . . . , Vk} 7^ 
then this is true because F is a connected T-join. Otherwise, Vi € T for some 1 < i < k since 
T ^ 0. But Vi has degree two using the edges in C and odd degree in the T-join F, so there is some 
edge in F incident to Vi that has not been included in C. 

Suppose uvi is an edge in F that is not listed among the edges in C. Remove uvi and tw+i 
from F and add uvi + \ to F (where we let vi + \ denote v\ if i = k). If u = 1^+1, then we simply 
remove uvi and tw+i without adding any edges. Denote the resulting multiset of edges by F' . By 
the triangle inequality, we have c(F') < c(F). 

The parity of the degrees of the nodes does not change, so F' is still a T-join. Furthermore, 
we claim that the graph H = (V,F') is connected. To see this, observe that H = (V,F f ) has a 
walk W' between a pair of nodes v, w if and only if (V, F) has a walk W between v, w, because 
any occurrence of in W could be replaced by the sequence of edges given by C \ {viVi + ±}, 

similarly, uvi could be replaced by uvi + ±, C \ {i^+i}, and any occurrence of uvi + \ in W' could be 
replaced by uvi,ViVi + \. 

This completes the proof: in a metric graph, given a connected T-join that has cycles or multi- 
edges, we can find a connected T-join of smaller size and no greater cost, assuming T ^ 0. □ 
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Let F be a connected T-join and consider any T-even subset of nodes S. Observe that |<5f(<S)| 
is even; this follows by applying Lemma 12.11 to F and noting that the set of wrong-degree nodes 
D is empty. This fact and Proposition 12.21 lead to our linear programming relaxation (L.P.I) for 
the minimum-cost connected T-join problem. The optimal value of (L.P.I) gives a lower bound on 
the minimum cost of a connected T-join, because there exists an optimal connected T-join whose 
incidence vector satisfies all the constraints of (L.P.I). 
(L.P.I) minimize: J2e&E c e x e 

subject to : x(E(S)) < \S\ - 1 VS C V, \S\ > 2 
x(E(V)) = \V\-1 

x(6(S)) > 2 V0 C S C V, \S n T| even 

x e > Ve E E 

The preceding discussion shows that the optimal value of this linear program is a lower bound 
for the optimal cost for the connected T-join problem when T ^ 0. Using the ellipsoid method, 
we can solve this linear program efficiently. The first two constraints assert that a feasible solution 
x must be in the spanning tree polytope and these can be separated over efficiently (see [TT]). 
The last constraints say that the total x-value assigned to edges crossing any particular T-even cut 
should be at least 2. An efficient separation oracle for these constraints was developed by Barahona 
and Conforti [3J. 

Finally, we recall a linear programming formulation for the minimum cost T-join problem, 
assuming nonnegative costs. The extreme points of this LP are integral, see [16], meaning that the 
optimal value of this LP is equal to the minimum cost of a T-join. We call any feasible solution to 
the following linear program a fractional T-join. 
(L.P.2) minimize : J2 e &E c e x e 

subject to : x{5{U)) > 1 W C V, \U D T| odd 
x e Ve € T 



3 A |- Approximation Algorithm 

Hoogeveen |10] showed that Christofides' 3/2-approximation algorithm for the TSP (the case when 
T = 0) extends to give a 5/3-approximation algorithm for the s, t path TSP (the case when 
T = {s, t}). Later, An, Kleinberg, and Shmoys (AKS) [JJ proved that the 5/3-approximation 
guarantee holds with respect to (the optimal value of) an LP relaxation for the s, t path TSP. 

It turns out that Christofides' algorithm generalizes to give a 5/3-approximation algorithm for 
the min-cost connected T-join problem; this is observed in [15]. The (generalized) algorithm first 
computes a minimum spanning tree J C E(G). Then let D denote the set of "wrong degree" nodes 
in J. That is, D consists of the nodes in T that have even degree in J and the nodes in V \ T that 
have odd degree in J. Let M C E(G) be a minimum-cost D-join. Then the multiset F = JUM (F 
has two copies of each edge in JC\M) forms a connected T-join. Thus the algorithm is combinatorial 
and does not require solving any linear programs. The next result uses the method of An et al. 
to show that the algorithm achieves an approximation guarantee of 5/3 w.r.t. the optimal value of 
the LP relaxation (L.P.I); we include the proof, since it serves as an introduction to our improved 
approximation algorithm that is presented in the next section. 

Theorem 3.1 (An, Kleinberg, and Shmoys |lj) Let x* be an optimal solution for the linear 
programming relaxation of the connected T-join problem, (L.P.I), and let OPT^p denote the opti- 
mal value X^eS-B c e x t- Then the solution F computed by the algorithm has cost < ^OPT^p. 
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Proof. The first two constraints of the linear program ensure that any feasible solution x 
is contained in the spanning tree polytope of G, that is, x is a convex combination of zero-one 
incidence vectors of spanning trees of G, [16]. Let J be a minimum spanning tree; then, we have 
c(J) < OPT LP . 

Let y := 3 x(J) + ^ x *'-> we claim that y is a fractional D-join. By the integrality of the D-join 
polyhedron, this would show that the cost of the D-join M is < ^OPTlp, and hence, the cost of 
F is < \OPT LP . 

To see that y is a fractional D-join, consider any set of nodes U that is D-odd. If U is also 
T-odd, then Lemma 12.11 part (ii) implies that is even; moreover, J is connected, hence, 

|<5j([/)| has size > 2. Also, we have x*(5(U)) > 1, hence, we have y(5(U)) > 1. Otherwise, if U is 
T-even, then x*(5(U)) > 2 by the last constraints of the linear program, and J n 5(U) has size > 1 
since J is connected. Thus we have y(8(U)) > 1 in this case as well. Hence, y(5(U)) > 1 holds for 
every D-odd set U C V, therefore, by (L.P.2), y is a fractional D-join. □ 

4 An Improved Approximation For \T\ > 4 

In this section, we improve on the approximation guarantee of 5/3 for the mincost connected T-join 
problem, by extending the approximation algorithm and analysis by An et al. p], for the s,t path 
TSP. We assume |T| > 4, and we prove an approximation guarantee of ^ = 1.625. (We note that 
the analysis in pQ for the case \T\ = 2 applies also to the linear program (L.P.I); there is a minor 
difference between the two LP relaxations since (L.P.I) does not have degree constraints for the 
nodes; but, the degree constraints in their LP are only required in their analysis to show that their 
LP solution is a convex combination of spanning trees.) 

Theorem 4.1 There is an algorithm (described in Section \4-l\ ) that finds a connected T-join F of 
cost at most ^ times the optimum value of linear program (L.P.I). 

4.1 The Algorithm 

Let x* denote an optimal solution to the linear programming relaxation for the minimum-cost 
connected T-join problem. The first two constraints of the LP allow us to decompose x* as a 
convex combination of incidence vectors of spanning trees. That is, there exist spanning trees 
Ji,...,Jk and non-negative values Ai,...,Afc summing to 1 such that x* = Yli=i xW)- By 
Caratheodory's theorem, we may assume k < \E\ + 1 and it is possible to find these spanning trees 
in polynomial time, |16j . For each spanning tree Jj, let Di denote the set of nodes that have the 
"wrong" degree in Jj, that is, Di consists of the nodes in T that have even degree in Jj and the 
nodes in V \ T that have odd degree in Jj. Let Mi be a minimum cost Dj-join and let Fi be the 
multiset formed by the union of Mj and Jj. Clearly, each Fi is a connected T-join. We output the 
cheapest of these solutions. 

It is easier to analyze a related randomized algorithm. Rather than trying every tree Jj, our 
algorithm randomly selects a single tree J by choosing Jj with probability Aj. Since the deterministic 
algorithm tries all such trees, the cost of the solution found by the deterministic algorithm is at 
most the expected cost of the solution found by this randomized algorithm. Let D denote the set 
of nodes of wrong degree in J, M denote the minimum-cost Z?-join, and F denote the (multiset) 
union of M and J. The randomized algorithm returns F. 
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The expected cost of F is the expected cost of J plus the expected cost of the Z?-join M. The 
expected cost of the tree J is precisely the cost of x* since each edge e has probability precisely x* 
of appearing in J. We will show that the expected cost of M is at most | times the cost of x*. 

4.2 Constructing the Fractional D-Join 

As in the proof of the 5/3-approximation guarantee, we will construct a fractional D-join. However, 
instead of using exactly 3 of x(J) and 3 of x*, we will construct the fractional TJ-join as y := 
a " x(J) + ■ x* + z where x* £ M) e ^, z is some "correction" vector in Wv E \ to be described below, 
and a and /3 are values which will be specified shortly. Again, by the integrality of the T-join 
polyhedron, the cost of M will be at most the cost of y. By linearity of expectation, the expected 
cost of y will be exactly a + /3 times the cost of x* plus the expected cost of z. 

The following lemma shows that for certain a and /3, the correction vector is not needed for 
many cuts. The proof is similar to a result in pQ. 

Lemma 4.2 Suppose a + 2/3 > 1. Then (a ■ x(J) + P • X *)(S(U)) > 1 if U is either 

(i) T-even, or 

(ii) T-odd and D-odd, with x*(5(U)) > ±=M. 

Proof. First, suppose U is T-even. Then x*(5(U)) > 2 by the LP constraints. Since J is 
connected, then | J n 5(U)\ > 1. Therefore, we have a ■ x(J)($(U)) > a and /3 • x*(5(U)) > 2/3; the 
sum of the two terms is > a + 2/3 > 1. 

Now, consider part (ii). Suppose that U is T-odd with x*(S(U)) > 1 ~g 2a . Since U is both 
T-odd and -D-odd, Lemma 12. II part (ii) implies that \J n 6(U)\ is even; moreover, J is a spanning 
tree, hence, J has > 2 edges in S(U). Consequently, we have a ■ x(J)(d(U)) > 2a, and moreover, 
j3 ■ x*(5(U)) > 1 — 2a by the assumption on x*(5(U)); the lemma follows, since the sum of the two 
terms is > 1. □ 

It will be convenient to fix a particular node t € T. Unless otherwise specified, when discussing 
a cut of the graph we will take the set S C V representing the cut to be such that t S, thus the 
cut will be denoted 5(S), S C V — t. As T-odd cuts of the graph that have small x* capacity will 
be used frequently in our analysis, we employ the following definition. 

Definition 4.3 Let r > 0. A T-odd subset of nodes S is called r-narrow if x*(5(S)) < 1 + r. 

Using this definition, Lemma 14.21 savs that if a + 2/3 > 1 with both a, /3 > 0, then the vector 
a 'X(<^)+/3'£* satisfies all constraints defining the T>-join polyhedron except, perhaps, the constraints 
corresponding to T-odd, r-narrow cuts for r > ±^ - 1. 

An et al. in [JJ, proved that if R and S are distinct r-narrow, T-odd cuts then either S C R or 
R C S. A generalization of this result to connected T-joins is the following. 

Lemma 4.4 If t < 1 and R and S are distinct r-narrow cuts, then R and S do not cross. 

Proof. Assume, for the sake of contradiction, that R and S cross. There are two cases to consider, 
depending on the cardinality of R n S (1 T. If R n S n T is odd, then R\S and S\R are nonempty, 
proper subsets of V that have even intersection with T. But then we have 

2 + 2r > x*(5(R)) + x*{5{S)) > x*(5(R \ S)) + x*{5{S \R))>2 + 2, 
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where the last inequality follows from the LP constraints applied to the T-even sets R\S and S\R. 
However, this contradicts r < 1. 

If, on the other hand, R n S H T is even, then RnS and RU S are nonempty, proper subsets of 
y that have even intersection with T. A similar contradiction can be reached in this case using the 
inequality x*(5(R)) + x*{5(S)) > x*{S(Rn S)) + x*{5(RU S)), where we have ^ RnS,RUS 
because R, S cross. □ 

Another way to state Lemma 14.41 is that the r-narrow, T-odd cuts of the graph form a laminar 
family C of nonempty subsets of V \ {t}. 

The correction vector z that we add to a ■ x(J) + f3 ■ x* for the T-odd, r-narrow cuts can be 
constructed from the following lemma. The main difference from the analogous result in pQ is that 
we require a further restriction on the size of r. 

Lemma 4.5 Let C = {U{\ be the laminar family of T-odd, T-narrow cuts. For r < ^ there exists 
vectors f u G JR} E ^, one for each cut Ui G C, such that the following three conditions hold. 

1. For each U e £, f u > 

3. For each U G C, f u (5(U)) > 1 

The proof of this lemma is deferred to the next section. Assuming this lemma, we will now 
show how to complete the analysis of the algorithm. We now fix r to be ^. We also set a := ^ and 
(3 := |. For these choices of parameters, we have a + 2/3 > 1 and r = l ~^ a — 1. 

We construct the correction vector z by including an appropriate multiple of f u for each D-odd 
cut U £ C Formally, 

z= Yl a-2a-(3x*(5(U)))-f u . 
\ur\D\ odd 

Since x*{5(U)) < 1 + r and r = - 1, we have 1 - 2a - f3x*(5{U)) > for each U G £ which 
shows z > 0. From this, Lemma 14.21 shows that y(5(U)) > 1 for each D-odd, T-even cut U and 
each D-odd, T-odd cut U that is not r-narrow. Finally, if U is D-odd, T-odd and r-narrow (so 
U G £), then f u (5{U)) > 1 so y(6(U)) > 2a + f3x*{5{U)) + (1 - 2a - l3x*{5{U))) = 1. Thus, we 
have proved the next result. 

Lemma 4.6 The vector y is a fractional D-join. 

We conclude the analysis by bounding the expected cost of y. The next result states that the 
probability that a T-odd cut U is also D-odd is < x*(6(U)) — 1; this is an immediate extension of 
a similar statement in [1]. 

Fact 4.7 Let U be a T-odd set. Suppose that J is a random spanning tree (obtained from x* by 
choosing Ji with probability Xi). Then Pr[|D D U\ is odd] < x*(5(U)) — 1. 

Therefore, 

E[cost(y)] = (a + 0) cost(x*) + ^(1 - 2a - Px*(U)) ■ Pr[|D n U\ is odd] • cost(/ c/ ). 

U£C 



S 



Now, for each U € £ we can bound (l-2a-(3x*(6(U)))-~Pr[\DnU\ is odd] by (l-2a-/3s*(<J(f7)))- 
(x*(6(U)) - 1). This is -^W^)) 2 ^*^))- 3 , For X *(5(U)) bound between 1 and §, the maximum 
value of this function is achieved at x*(S(U)) = | and its value is 

So, the expected cost of y is at most (a+(3)-cost(x*) + -^ 'Yluec cos ^(f U )- Since YlutC / ^ x *i 
we have the final bound on the expected cost of y being (a + /? + 55) cost(x*). Adding this to the 
expected cost of J, we have that the expected cost of the connected T-join is at most ^cost(x*). 
Note that this is strictly less than |. 

4.3 Tight Example for r 

Here, we present an example for \T\ = 4 showing that ^ is the optimal value for r for our methods. 

Let G = (V, E) be the complete graph on four nodes K4, and let T = V . It can be seen that 
x G IR'^I with x e = ^, Ve G E, satisfies all the constraints of the LP relaxation (L.P.I). Choose any 
one node to be t; recall that for any cut 5(S) of the graph, we assume that the set S representing 
the cut is a subset of V — t. Suppose that we choose a value strictly greater than ^ for r. Then we 
have four T-odd, r-narrow cuts, namely, the cuts of the three singletons S = {v}, v G V — t, and 
the cut of S = V — t; each of these cuts 5(S) has x(5(S)) = | < 1 + r. Clearly, Lemma H3] does 
not apply, because the sum of f s (S(S)) over the four r-narrow cuts has to be > 4, but we have 
x(E) = 3, hence, part 2 of Lemma [4.51 cannot hold. On the other hand, the lemma holds for t = \- 

4.4 The Correction Vector 

We complete the analysis by proving Lemma 14.51 As in [1] , we set up a flow network and use 
the max-flow/min-cut theorem to ensure a flow exists with the desired properties. However, our 
analysis is complicated by the fact that the sets in C are laminar rather than simply nested. 

Our argument on the existence of the desired flow uses the following inequality for spanning 
trees. For a connected graph H and a partition of V(H) into k non-empty sets, V = {Pi, . . . , Pk}, 
the number of edges that have endpoints in different sets in V is at least k — 1, that is, {^(h)^)] ^ 
k — 1. Thus, as our vector x* is a convex combination of (incidence vectors of) spanning trees, we 
have x*(5(Pi, . . . , Pk)) > k — 1, for any partition P\, . . . , P^ of V(G) into nonempty sets. 

Let £' be a subfamily of C. For U E let gc(U) be the nodes in U that are not found in any 
smaller subset in £ . That is, 

gc ,(U) = {v e U : v £ W for any W G £' with W C U}. 

The following result is the key to generalizing the argument in [1] to our setting. 

Lemma 4.8 Suppose that r < \. Let C! be any subfamily of £ The family of subsets {gc(U) : 
U G £'} U {V \ UvKe£' ^} forms a partition of V , and each such subset is nonempty. 

Proof. Each node v in some subset in the family £ is in gc{U) for some U G £ since v is 
"assigned" to the smallest subset of £ containing v. All other nodes appear in the set ^\Uwe£' 
By construction, the sets are disjoint. It remains to prove that each of the sets is nonempty. 

Since t is not in any subset in the family £ , it must be that ^\Uwe£' W For a set U G £ ', 
let mc{U) be the maximal proper subsets of U in the subfamily £ . That is, W G £ is in mc(U) if 
W C U and no other subset W G £ satisfies W <^W <^U. Note that g c >(U) = U\\J Wem , (t/) 
and the sets in m£/(C/) are disjoint. 
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For the sake of contradiction, suppose that gc(U) = 0. Then U is the disjoint union of the sets 
in m£'(U). Since every set in C' is T-odd, then \mc(U)\ is also odd and we let 2k + 1 = \mc(U)\. 
Note that 2k + 1 > 3 which implies k > 1. 

Now we examine the quantity X = x*(5(U)) + ^iyem £ /(c/) a?*(5(W)). One the one hand, since 
U and each W G m c >(U) are r-narrow cuts, then X < (1 + r) + (2k + 1)(1 + r) = (2fc + 2)(1 + r). 
On the other hand, we consider the partition V = {W : W G Tri£i(U)} U {V \ f } of V. We claim 
that 2x*(5('P)) < X. To see this, notice that any edge e with ends in V \ U and Wo for some 
Wo G rri£i(U) is counted twice in X. (Once for 5(f) and once for 5(Wo).) Similarly, for any edge e 
with ends in different subsets Wo, W\ in mc(U) is also counted twice. (Once for 5(Wo) and once for 
5(Wi).) By the partition inequality, we have 2(2fc + 1) < 2x*(5(V)) < X < (2k + 2)(1 + r). Thus, 
2(2k + 1) < (2k + 2)(1+ r) which implies r > ^ > 2 since > 1. This contradicts r < |. □ 
Proof of Lemma 14.51 We now finish construction of the vectors f u , U G C by describing the 
flow network. Create a directed graph with 4 layers of nodes, where the first layer has a single 
source node v s and the last layer has a single sink node vt- We have a node uj/ for each r-narrow 
cut U € C in the second layer, and a node i> e for each edge e 6 E(G) in the third layer. For each 
U E C, there is an arc from v s to v\j with capacity 1. For each edge e of G, there is an arc from v e 
to vt with capacity x*. Finally, for each U € C and each e € 5(f) we have an arc from vjj to t> e 
with capacity oo. 

We claim that there is a flow from v s to i>t that saturates each of the arcs originating from v s ; 
this is proved below. From such a flow, we construct the vectors f u for U E C by setting to 
be the amount of flow sent on the arc from v\j to v e (where we use fj/ = if e ^ 5(U)). We have 
/ > and, by the capacities of the arcs entering vt, Ylu&c f — x * • Finally, since each U € £ 
has the arc from v s to vjj saturated by one unit of flow, we have f u (5(U)) > 1. Thus, the vectors 
f u , U € L satisfy the requirements of Lemma l4~5l 

We prove the existence of this flow by the max-flow/min-cut theorem. Let S be any cut with 
v s G S, Vt S. If S contains some node Vjj for U € C but not v e for some e € 8(U), then the 
capacity of S is oo. Otherwise, let Cs denote the subfamily of sets U € C such that the node vy 
representing U is in S. Then the total capacity of the arcs leaving S is at least 

\£\-\£s\+ £ <■ 

ee<5(C7) 
for some U 

Consider the collection of sets Vs ■= {gc s (U), U G Cs} U {V \ {Jwec s W }- From Lemma \£M 
each set in Vs is nonempty and the sets of Vs form a partition of V. 

Next, we claim that e G S(Vs) if and only if e G 8(U) for some U G Cs- Consider an edge 
e G S(Vs)- If one endpoint of e is in V \ Uwe£s * nen ^ ne °ther endpoint lies in gc s (U) where 
U is the smallest set in Cs containing this endpoint. But then e G S(U) because e has exactly one 
endpoint in U. Otherwise, e = uv has u G gc s (U) and t> G <7£ S (W) for distinct sets U,W G £5. 
Suppose, without loss of generality, that either £/CWorf7nW = 0. Then by definition of 
fl , £ s (l / F)) we cannot have v G J7. Therefore, e G 5(f). 

Conversely, if e = to G 5(f) for some f G £5 with, say, u £ U, then it lies in <7£ S (W) where 
W is the smallest set in Cs containing u. Since W C ?7 and v ^ U, then v must lie in a different 
set in Vs- Thus, e G 5(Vs)- 
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This shows 

£ xt = x*(Vs)>\C s \, 

ee5(U) 
for some U&Cs 

where the inequality holds since l^sl = \Cs\ + 1- Therefore, the capacity of the cut S is at least 
\C\. Since this holds for all v s ,vt cuts S, then the maximum flow is at least \C\. Finally, the cut 
S = {v s } has capacity precisely \C\ so the maximum v s ,Vt flow saturates all of the arcs exiting v s . 

□ 

5 Prize-Collecting Connected T-Joins 

We start with a linear programming relaxation of the prize-collecting problem. For notational 
convenience, we define a large penalty for each node in T. We also designate an arbitrary node 
t* € T as the root node. The LP has a variable Zx for each set X C V — t* such that Zx = 1 
indicates that X is the set of isolated nodes of an optimal integral solution; moreover, we have a cut 
constraint for each nonempty subset S of V — £*; the requirement (r.h.s. value) of a cut constraint 
is 1 or 2, depending on whether the set S is T-odd or T-even. 

Let Q denote the T-odd subsets of V — t* and let 1Z denote the non-empty, T-even subsets of 
V — t*. Our LP relaxation is stated below. 

(L.P.3) minimize : ^^c e x e + ir(X)Zx 

e XCV-t* 

subject to : x(S(Q)) > 1 VQgQ 

x(8(R))+ 2Z X > 2 VR£K 

X:XDR,XCV-t* 

x,Z > 

Consider any solution to the prize-collecting connected T-join problem. Let I QV — t* denote 
the set of isolated nodes and let F denote the connected T-join of G \ /; thus this solution incurs 
a total cost of c(F) for the edges in F plus vr(I) for the penalties of the nodes in I. We define 
an integral solution to (L.P.3) by taking Zi = 1, Z$ = for all other subsets S C V — t*, and 
moreover, for each edge e, we take x e to be the number of copies of e used in F. By construction, 
the cost of this solution (x, Z) is equal to c(F) + 

For every Q £ Q, observe that at least one edge of 5{Q) is in F (since F connects the nodes 
in T); this justifies the first set of constraints in the LP relaxation. Now, focus on the second 
set of constraints in the LP relaxation, and consider any one set R G 1Z and its constraint in 
(L.P.3). If R C /, then the constraint is satisfied due to the term 2Zj (in the left-hand side of the 
constraint). Otherwise, if R % I then at least one edge in 6(R) is in F (since F connects the nodes 
in {t} U R \ I); moreover, by Lemma l2~T| \5f{R)\ is even, so at least two edges of S(R) are in F; 
hence, the constraint is satisfied if R <£. I. The above discussion is summarized by the next result. 

Fact 5.1 The optimal value of (L.P.3) is at most the optimal cost of a prize- collecting connected 
T-join. 

The dual of (L.P.3) has a variable vq for each primal-constraint of the first type, and a variable 
Dr for each primal-constraint of the second type; thus, each T-odd set Q C V — t* has a dual 
variable vq, and each T-even set C R C V — t* has a dual variable vr. 
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(L.P.4) maximize : y Q + ^ 2^ 

QeQ Kerc 
subject to : < c e Ve£fi 

5eQU^:ee5(S) 

2y R < tt(X) Via-f 
2/ > 

Consider the dual LP and a feasible solution y; we call an edge e tig/ii if the constraint for e 
holds with equality, and we call a set of nodes X ir -tight if the constraint for X holds with equality. 



5.1 The Primal-Dual Algorithm 

The algorithm proceeds in phases. In each phase, a partition V of V(G) is maintained; some sets 
in this partition are active and some are inactive. Throughout, the set containing the root, t*, is 
taken to be inactive. The initial partition consists of singletons {v} for every v € V. Each of the 
sets {v},v € V — t*, is designated as active. We initialize ys := for every subset S of V. Let F 
denote the set of edges chosen during the growing phase of the algorithm; we initialize F := 0. 

Each phase proceeds as follows. We simultaneously raise ys for every active set S in the current 
partition at a uniform rate. (Recall that sets containing t* have no dual variables. Since the 
algorithm designates such sets as inactive, it never uses dual variables of such sets.) The phase 
ends when either (i) an edge becomes tight or (ii) an active subset of nodes S becomes -zr-tight. If 
the former occurs, then we pick any edge e = vw that becomes tight; its endpoints v and w must 
be in different components of the current partition; we add e to F, and we merge the components 
in the current partition containing v and w; we call the resulting new component inactive if it 
contains the root, otherwise, we call the new component active. If the latter occurs, that is, if an 
active subset S C V in the partition becomes 7r-tight, then S becomes inactive. The algorithm 
terminates when there are no remaining active sets. 

Standard arguments show that the dual solution at the end of the algorithm is feasible and that 
the set of edges F chosen throughout the algorithm is acyclic. We prune our solution F in the usual 
way. Namely, we iteratively discard any edge e such that there exists an inclusion-wise maximal 
set X that was inactive at some point of the algorithm and 5{X) = {e}; moreover, after this stage 
of pruning, we discard all remaining edges that are not in the component of t*. Let J denote the 
remaining subset of edges. The subgraph that remains after discarding the isolated nodes is a tree 
J containing the root t*. Furthermore, since each node in T has a large penalty, then J contains 
all nodes in T. 

Finally, let D C V(J) denote the set of nodes that have the wrong degree in the tree J. We 
compute a minimum-cost D-join M and finally, we output J U M as a connected T-join on V{ J). 
Let I denote the set of nodes not included in J, thus I = V \ V(J). 



5.2 Analysis of the Primal-Dual Algorithm 

Our argument for bounding the cost of the tree J and the penalties of the nodes in / is similar to 
known arguments. A simple way to bound the cost of the D-join M would be to pair the nodes in 
D using edge-disjoint paths in J, so that adding M to J at most doubles the cost of the set of edges 
used. However, we can improve on this simple analysis of the cost of the D-join by scrutinizing the 
analysis of the dual growing phase. The following theorem summarizes the cost bounds. 
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Theorem 5.2 The penalty of the nodes in I is exactly 2 yx, the cost of the tree J is 



xci 



-( 2 -|rT=i) E^ + 2 E 

v 11 7 QeQ ReK,Rp 



and the cost of the D-join M is 

- ( 1_ whr) E^+ 2 E f«- 

v 11 7 QeQ ReK,Rp 

Let p(|T|) denote the approximation guarantee of our algorithm; below, we show that p(|T|) = 
3 — m_i for \T\ > 4, and p(2) = 2. Before presenting the proof, we remark that this shows 
cost (J U M) + /o(|T|) • 7r(J) is at most times the cost of the dual solution y. 

Proof. The equation for the penalty is standard and follows by construction since I (being the 
union of the 7r-tight inactive components that were pruned) is 7r-tight. The analysis for the cost 
of J is nearly identical to Goemans and Williamson's analysis [9] and is included in Appendix lAl 
for completeness. One minor difference in our analysis comes from the fact that there are at most 
\T\ components that are T-odd at any point in the execution, and we exploit this fact to derive an 
approximation guarantee that is tight on some examples. 

To bound the cost of the D-join M, we consider a possibly different D-join M' obtained by 
pairing the nodes in D with edge-disjoint paths in J. Clearly c(M) < c(M') so it suffices to bound 
the cost of M' . Let J be the subset of J consisting of edges e such that J \ {e} consists of two 
-D-even components. Note that M' n J = since, by parity arguments, any D-join must have an 
even number of edges crossing any D-even cut, and each edge of J is used at most once in M' . The 
next claim is the key to the improved cost analysis for minimal D-joins. 

Claim 5.3 Let Q be a T-odd component from any step in the execution of the algorithm. Then at 
least one of the edges in Sj(Q) belongs to J. That is, 5j\j'(Q) is a proper subset of 5j(Q). 

Proof of Claim We have two cases to consider, either Q is D-odd or it is D-even. First, suppose 
Q is D-odd. Then, by Lemma |2. II part (ii), is even. Focus on J \ 5j(Q) and observe that 

it has an odd number of connected components, so at least one of them, say S, must be D-even. 
Thus, the edge in 5j(Q) connecting Q to S is in J. 

Similarly, if Q is D-even, then |<5j(<3)| is odd. Then J\5j(Q) has an even number of connected 
components, hence, there is another connected component that is D-even, call it S, S ^ Q. Then, 
the edge between Q and S is in J. □ 

Using this, we can bound the cost of M 1 in the following way. 



eSM' eGM' 



/ \ 

^2 VQ+ E VR 

QeQ Ran 

\eeS(Q) R£l,ee5(R) ) 



E \ 6 M'{Q)\VQ + £ \6m>{R)\vr 
QeQ Ren,Rp 



QeQ Ren,Rp ^ 11 'QeQ Ren,R^i 
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The first inequality follows from the claim for the T-odd sets in Q and the simple fact that 5m'{R) Q 
5j(R) for R 6 7Z. The second inequality follows from our analysis of the cost of J in Appendix lAl 

□ 

This completes the analysis of the primal-dual algorithm. Our algorithm and analysis are also 
valid in the case \T\ = 2, and it can be seen that our approximation guarantee for \T\ = 2 is 
p{2) = 2. In fact, our algorithm in this case is essentially identical to the 2-approximation for the 
prize-collecting s,t path TSP presented in [5]. 

Our analysis is tight even up to lower-order terms when |T| > 4. This is realized by a cycle 
on T, that is, G = (T, E) consists of an even-length cycle with at least 4 nodes. Let t* € T be a 
designated node and let the edges incident to it have cost \ while all other edges have cost one. 
The dual growth phase grows y^ v y to 1/2 for every singleton v G T — t*. The algorithm could find 
a tree of cost |T| — | (by picking all edges of G except one of the two edges incident to i*), and 

|X| — 2 ITI — 1 

then find a D-join of cost — ^ — . Observe that the cost of the dual solution is — ^ — , whereas the 
connected T-join constructed by the algorithm has cost 3 ^j~ 5 ; the ratio of these two quantities is 
exactly 3 — j^px- 

6 Conclusions 

We presented a = 1.625 approximation algorithm for the mincost connected T-join problem 
whose analysis closely followed the analysis of the s,t path TSP algorithm in pQ. Furthermore, 
we presented a max{3 — 2}-approximation algorithm for a prize-collecting version of the 

problem; this algorithm is based on the primal-dual method [9] and it is Lagrangian multiplier 
preserving. 

Our algorithms in Sections S] and [5] are based on the LP relaxations (L.P.I) in Section [2] and 
(L.P.3) in Section [5l respectively. Unfortunately, we do not have tight bounds on the integrality 
ratios of these LP relaxations. As far as we know, the best lower bound on the integrality ratio of 
(L.P.I) is |, and this follows from an example for the s, t path TSP in [1] Figure 1]. 

Acknowledgements: We thank a number of colleagues for useful discussions; in particular, we 
thank Jochen Konemann and Chaitanya Swamy. 
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A Appendix: Analysis of the Dual Growing Phase 



We bound the cost of J as follows. 



eSJ eSJ 



VQ+ E VR 

i QeQ Ren 

\eeS(Q) Rp,eeS(R) / 



Qes Ren 

Rp 



The first equation holds because the edges in J are tight. That the inner sum over subsets 
R € 7Z can be restricted to subsets ^ I follows because no subset of nodes contributing to the dual 
constraint for an edge e € J is contained in I. The second equation follows by rearranging the 
sums. 

Now consider a step in the execution with corresponding partition V of V(G). Add the edges 
of J to the graph (V,0), and then contract each of the sets S belonging to the partition V. The 
resulting graph is a tree plus some isolated nodes, because each contracted set S of V induces a tree 
of (V, F) and so the subgraph of (V, J) induced by S consists of a tree plus some isolated nodes, 
see [HldT]. Let C denote the T-odd active sets in V, let C denote the T-even active sets in V which 
are not contained in /, and let X denote the inactive sets B € V with 5j(B) 7^ ("P could contain 
inactive sets B with 5j{B) = 0, but such sets are not relevant for the arguments below). We can 
identify these sets with nodes in the contracted graph. It can be seen that each B € I, except for 
one, has degree at least 2 in this contracted graph by our pruning phase; if a set in X contains the 
root, then its degree could be one, see [T71 Chapter 14.1]. Notice also that \C\ < \T\ — 1 because 
each T-odd active set must contain a node in T — t*. By counting degrees, we have 

2\C\ + 2\C\ + 2\i\-2 = ^IW)l + ElWI + El^( 5 )l 

Qec Rec Bex 



hence, 



E 1 w)i + E iwi ^ 2 i c i + 2 i^i - 1 < ( 2 - jfhr) ' |c| + 2| ^' 



<2 eC ReC 

where the last inequality holds because \C\ < \T\ — 1. Suppose that the dual variables of the active 
sets were raised by A during this phase. Then 

£ A\5j(Q)\ + A M R )\ < ( 2 " tJzT[) A ' C I + 2A ^I- 
Qec m c VII/ 

Since this holds over each phase of the primal-dual algorithm, then by applying induction on the 
number of phases in the execution, we have 

J2\sj(q)\vq+ E N%fl<(2-^ I )E»+ 2 E 

QeQ Ren,R£i ^ 11 ' QeQ R&n,R£i 

This proves the bound on the cost of J. 
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